flow modeling
Potential failures of physics-informed machine learning in traffic flow modeling: theoretical and experimental analysis
Lei, Yuan-Zheng, Gong, Yaobang, Chen, Dianwei, Cheng, Yao, Yang, Xianfeng Terry
Potential failures of physics-informed machine learning in traffic flow modeling: theoretical and experimental analysis Yuan-Zheng Lei a, Yaobang Gong a, Dianwei Chen a, Yao Cheng b, Xianfeng Terry Yang* a a University of Maryland, College Park, MD 20742, United States b Florida Atlantic University, Boca Raton, FL 33431, United StatesAbstract This study investigates why physics-informed machine learning (PIML) may fail when it comes to macroscopic traffic flow modeling. We define failure as the case where a PIML model underperforms both its purely data-driven and purely physics-based counterparts by a given threshold. Our analysis shows that physics residuals themselves do not inherently hinder the optimization of the loss function, which is a main reason responsible for the failure of the PIML model in other fields. Instead, successful parameter updates require both machine-learning and physics gradients to form acute angles with the true gradient. Our experiment shows that this condition may be hard to achieve for PIML under a general low-resolution loop dataset. In particular, when the traffic data resolution is low, a neural network cannot accurately approximate density and speed, causing the constructed physics residuals, already affected by discrete sampling and temporal averaging, to lose their ability to reflect the actual PDE dynamics. This degradation can directly lead to PIML failure. From a theoretical standpoint, we show that although the exact solutions of the LWR and ARZ models are weak solutions, for piecewise C k initial data and under mild conditions, the solutions remain C k on the complement of the shock set over finite time, with only finitely many shock waves, where C k refers to k times continuously differentiable. Since the shock set has Lebesgue measure zero, the probability of a detector measurement or auxiliary collocation point lying exactly on a discontinuity is essentially zero; asymptotically, every auxiliary point admits a sufficiently small smooth neighborhood where the physics residual is well-defined and valid. Consequently, the well-known limitation that MLPs cannot exactly represent non-smooth functions does not materially affect our setting, as the residual evaluation almost always occurs in smooth regions. We also investigate the error lower bounds of the MSE of physics residuals for PIML models under high-resolution data. We prove that higher-order models like ARZ possess strictly larger consistency error lower bounds than lower-order models like LWR under mild conditions.
Maximum Score Routing For Mixture-of-Experts
Dong, Bowen, Fan, Yilong, Sun, Yutao, Li, Zhenyu, Pan, Tengyu, Zhou, Xun, Wang, Jianyong
Routing networks in sparsely activated mixture-of-experts (MoE) dynamically allocate input tokens to top-k experts through differentiable sparse transformations, enabling scalable model capacity while preserving computational efficiency. Traditional MoE networks impose an expert capacity constraint to ensure GPU-friendly computation. However, this leads to token dropping when capacity is saturated and results in low hardware efficiency due to padding in underutilized experts. Removing the capacity constraint, in turn, compromises load balancing and computational efficiency. To address these issues, we propose Maximum Score Routing ($\mathbf{MaxScore}$), a novel MoE routing paradigm that models routing as a minimum-cost maximum-flow problem and integrates a SoftTopk operator. MaxScore resolves the fundamental limitations of iterative rerouting and optimal transport formulations, achieving lower training losses and higher evaluation scores at equivalent FLOPs compared to both constrained and unconstrained baselines. Implementation details and experimental configurations can be obtained from $\href{https://github.com/dongbw18/MaxScore.git}{MaxScore}$.
Online Workshop on AI & IoT for Flow Modeling by IISc Bangalore [Nov 20]: Registrations Open - Noticebard
The Indian Institute of Science (IISc), Bangalore is organizing an Online Workshop on AI & IoT for Flow Modeling on November 20, 2020, from 11 am to 2 pm. The Indian Institute of Science (IISc) is a public, deemed, research university for higher education and research in science, engineering, design, and management. It is located in Bangalore (Bengaluru), in the Indian state of Karnataka. The workshop is organized as part of the Indo-Dutch project, "Digital Twins for pipeline transport networks". The aim of the project is to develop a digital twin that connects sensor data and advanced fluid solvers in order to detect possible leakage of fluid from the pipeline in real-time.